Model Selection

Inference Acceleration

# Inference Acceleration

mera-mix-4x7B is a Mixture of Experts (MoE) model with half the scale of Mixtral-8x7B but comparable performance and faster inference speed.

Large Language Model

Prosparse Llama 2 7b

A large language model based on LLaMA-2-7B with activation sparsification, achieving high sparsity (89.32%) while maintaining original performance through the ProSparse method

Large Language Model

Transformers English

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase